The Fractal Dimension Making Similarity Queries More Efficient

نویسندگان

  • Adriano S. Arantes
  • Marcos R. Vieira
  • Agma J. M. Traina
  • Caetano Traina
چکیده

This paper presents a new algorithm to answer k -nearest neighbor queries called the Fractal k -Nearest Neighbor (k NNF ()). This algorithm takes advantage of the fractal dimension of the dataset under scan to estimate a suitable radius to shrinks a query that retrieves the k -nearest neighbors of a query object. k -NN() algorithms starts searching for elements at any distance from the query center, progressively reducing the allowed distance used to consider elements as worth to analyze. If a proper radius can be set to start the process, a significant reduction in the number of distance calculations can be achieved. The experiments performed with real and synthetic datasets over the access method Slim-tree, have shown that the efficiency of our approach makes the total processing time to drop up to 50%, while requires 25% less distance calculations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Resting-State fMRI Topological Graph Theory Properties in Methamphetamine Drug Users Applying Box-Counting Fractal Dimension

Introduction: Graph theoretical analysis of functional Magnetic Resonance Imaging (fMRI) data has provided new measures of mapping human brain in vivo. Of all methods to measure the functional connectivity between regions, Linear Correlation (LC) calculation of activity time series of the brain regions as a linear measure is considered the most ubiquitous one. The strength of the dependence obl...

متن کامل

ارائه روشی پویا جهت پاسخ به پرس‌وجوهای پیوسته تجمّعی اقتضایی

Data Streams are infinite, fast, time-stamp data elements which are received explosively. Generally, these elements need to be processed in an online, real-time way. So, algorithms to process data streams and answer queries on these streams are mostly one-pass. The execution of such algorithms has some challenges such as memory limitation, scheduling, and accuracy of answers. They will be more ...

متن کامل

Modelling the Self-similarity in Complex Networks Based on Coulomb's Law

Recently, self-similarity of complex networks have attracted much attention. Fractal dimension of complex network is an open issue. Hub repulsion plays an important role in fractal topologies. This paper models the repulsion among the nodes in the complex networks in calculation of the fractal dimension of the networks. The Coulomb’s law is adopted to represent the repulse between two nodes of ...

متن کامل

A new information dimension of complex networks

The fractal and self-similarity properties are revealed in many real complex networks. However, the classical information dimension of complex networks is not practical for real complex networks. In this paper, a new information dimension to characterize the dimension of complex networks is proposed. The difference of information for each box in the box-covering algorithm of complex networks is...

متن کامل

Self-similar fractals and arithmetic dynamics

‎The concept of self-similarity on subsets of algebraic varieties‎ ‎is defined by considering algebraic endomorphisms of the variety‎ ‎as `similarity' maps‎. ‎Self-similar fractals are subsets of algebraic varieties‎ ‎which can be written as a finite and disjoint union of‎ ‎`similar' copies‎. ‎Fractals provide a framework in which‎, ‎one can‎ ‎unite some results and conjectures in Diophantine g...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003